Adaptive sample size determination for the development of clinical prediction models
نویسندگان
چکیده
Abstract Background We suggest an adaptive sample size calculation method for developing clinical prediction models, in which model performance is monitored sequentially as new data comes in. Methods illustrate the approach using diagnosis of ovarian cancer ( n = 5914, 33% event fraction) and obstructive coronary artery disease (CAD; 4888, 44% fraction). used logistic regression to develop a consisting only priori selected predictors assumed linear relations continuous predictors. mimicked prospective patient recruitment by on 100 randomly patients, we bootstrapping internally validate model. added 50 random patients until reached 3000 re-estimated at each step. examined required satisfying following stopping rule: obtaining calibration slope ≥ 0.9 optimism c-statistic (or AUC) < 0.02 two consecutive sizes. This procedure was repeated 500 times. also investigated impact alternative modeling strategies: nonlinear correcting bias estimates (Firth’s correction). Results Better discrimination achieved (c-statistic with 7 predictors) than CAD 0.7 11 predictors). Adequate limited after median 450 (interquartile range 450–500) (22 events per parameter (EPP), 20–24) 850 (750–900) (33 EPP, 30–35). A stricter criterion, requiring AUC 0.01, met (23 EPP) 1500 (59 respectively. These sizes were much higher well-known 10 EPP rule thumb slightly recently published fixed Riley et al. Higher when relationships modeled, lower Firth’s correction used. Conclusions Adaptive determination can be useful supplement calculations, because it allows tailor specific context dynamic fashion.
منابع مشابه
Sample size determination for logistic regression
The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regr...
متن کاملComparing two Formulas of Sample Size Determination for Prevalence Studies
Background and purpose: Sample size and its determination is one of the most important problems in health researches. Calculating sample size for prevalence studies is one of the common questions of sample size topics. Minimum sample size with least complexity is desirable in order to achieve the basic goal of these studies. This study aims to compare two formulas of sample size calculation for...
متن کاملBayesian Sample size Determination for Longitudinal Studies with Continuous Response using Marginal Models
Introduction Longitudinal study designs are common in a lot of scientific researches, especially in medical, social and economic sciences. The reason is that longitudinal studies allow researchers to measure changes of each individual over time and often have higher statistical power than cross-sectional studies. Choosing an appropriate sample size is a crucial step in a successful study. A st...
متن کاملA mixed Bayesian/Frequentist approach in sample size determination problem for clinical trials
In this paper we introduce a stochastic optimization method based ona mixed Bayesian/frequentist approach to a sample size determinationproblem in a clinical trial. The data are assumed to come from a nor-mal distribution for which both the mean and the variance are unknown.In contrast to the usual Bayesian decision theoretic methodology, whichassumes a single decision maker, our method recogni...
متن کاملBayesian Determination of Sample Size in Longitudinal Studies with Binary Responses Using Random Effects Models
Sample size determination is important in all statistical studies including longitudinal studies. This is usually done by considering a target power to reduce the costs of sampling. Choosing the right sample size using efficient methods, ensures that the researcher achieve goal of the study, by spending the least amount of energy, time and money. In this article, using a method based on simulat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Diagnostic and prognostic research
سال: 2021
ISSN: ['2397-7523']
DOI: https://doi.org/10.1186/s41512-021-00096-5